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AMENDMENTS TO THE CLAIMS: 

1. (Currently amended) A system for identifying genes, comprising: 
a pattern database comprising patterns of amino acids; 

an input device for inputting a genomic DNA sequence; and 
a processor which: 

translates an open reading frame (ORF) of said DNA sequence into an Eimino acid 

translation; 

locates in said amino acid translation occurrences of said patterns from said 
pattern database; and 

determines whether said open reading frame includes at least part of a putative 
gene based on a number of said patterns of amino acids located in said amino acid translation of 

said ORF, and/or weighted values associated with iiaid occurrences of said patterns of amino 
acids located in said amino acid translation of said ORF. 

2. (Currently amended) The system according to claim 1, wherein said processor translates a 
plurality of open reading frames in said DNA sequence into amino acid translations, and locates 
in each amino acid translation occurrences of said patterns to determine whether each said 
plurality open reading frames includes at least pait of a putative gene. 

3. (Previously presented) The system according to claim 2, wherein said patterns comprise 
biologically significant patterns of amino acids in amino acid sequences. 

4. (Previously presented) The system according to claim 1, wherein said processor identifies 
a match of a pattern from said pattern database in said amino acid translation. 

5. (Original) The system according to claim 4, wherein said patterns are derived from a 
parent database comprising at least one amino acid sequence. 
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6. (Original) The system according to claim 4, wherein said patterns are derived from a 
parent database comprising at least one amino acid sequence fragment. 

7. (Original) The system according to claim 4, wherein said patterns are derived by using a 
pattern discovery algorithm. 

8. (Original) The system according to claim 4, wherein said patterns are derived by using the 
Teiresias algorithm. 

9. (Original) The system according to claim 4, wherein said ORF comprises a portion of 
said DNA sequence between a start codon and a stop codon. 

10. (Currently amended) The system according to claim 4, wherein said processor reports 
said ORP as at least part of a putative gene when a predetermined number of pattern matches is 
identified in said amino acid translation. 

1 1 . (Currently amended) The system according to claim 4, wherein each pattern is assigned a 
weight depending upon a relevance of said pattern in determining whether said ORF comprises at 
least part of a putative gene. 

12. (Currently amended) The system according to claim 4, wherein said ORF is reported as at 
l e ast part of a putative gene when the sum of weights corresponding to all patterns with matches 

in said amino acid translation exceeds a predetermined thi"eshold. 

13. (Original) The system according to claim 4, wherein said match is identified using a 
predetermined pattern matching algorithm. 
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14. (Original) The system according to claim 4, further comprising: 

a memory device for storing data and instmctions to be executed by said processor. 

15. (Original) The system according to claim 4, further comprising: 
a display device for displaying an output from said processor. 

16. (Currently amended) A method of identifying genes, comprising: 
providing a pattern database comprising patterns of amino acids; 
determining an open reading frame (ORF) in a genomic DNA sequence; 
generating an amino acid translation for said ORF; 

locating a match of a pattern from said pattern database in said amino acid trzinslation; 

determining whether said ORF includes at least part of a putative gene based on a number 
of said matching patterns of amino acids located in said amino acid translation of said ORF. 
and/or a weighted value associated with a matching said match of said pattern located in said 
amino acid translation of said ORF; and 

displaying a result of said determining whether said ORF includes at least part of a 
putative gene. 

17. (Original) The method according to claim 16, wherein said pattern database is generated 
from a database comprising at least one amino acid sequence. 

18. (Original) The method according to claim 16, wherein said pattern database is generated 
from a database comprising at least one amino acid sequence fragment. 

19. (Currently amended) The method according to claim 16, further comprising: 
identifying said ORF as at least part of a putative gene when a predetermined number of 

pattern matches is identified in said amino acid translation. 
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20. (Currently amended) The method according to claim 16, further comprising: 
assigning a weight to each pattern depending upon a relevance of said pattern in 

determining whether said ORF includes at least part of a putative gene. 

21. (Original) The method according to claim 16, further comprising: 
displaying said match of said pattern in said amino acid translation. 

22. (Original) The method according to claim 16, wherein said pattern database is generated 
using the Teiresias algorithm to derive said patterns from a parent database. 

23. (Currently amended) A programmable storage medium tangibly embodying a program of 
machine-readable instructions executable by a digital processing apparatus to perform a method 
for identifying genes, said method comprising: 

providing a pattern database comprising patterns of amino acids; 

determining an open reading frames (ORF) in a given genomic DNA sequence; 

generating an amino acid translation for each ORF; 

locating a match of a pattern from said pattern database in said amino acid 

translation; 

determining whether said ORF includes at least pait of a putative gene based on a 
number of matching patterns of amino acids located in said amino acid translation of said ORF, 
and/or a weighted value associated with a matching said match of said pattern located in said 
amino acid translation of said ORF; and 

displaying a result of said determining whether said ORF includes at least part of 
a putative gene. 

24. (Currently amended) The system according to claim 3, wherein said processor 
determines for each pattern in said pattern database whether the pattern is present in said amino 
acid translation by locating instances of said patterns in said amino acid translation, until a sum 
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of weights corresponding to all patterns with matches in said amino acid translation exceeds a 
predetermined threshold, at which point said processor identifies said ORF as at least part of a 
putative gene. 

25. (Previously presented) The system according to claim 1, further comprising: 

a parent database comprising a plurality of Eimino acid sequences, said patterns in sziid 
pattern database being derived from said plurality of amino acid sequences by using a pattern 
discovery algorithm; 

a memory device for storing data and instructions to be executed by said processor; £ind 
a display device for displaying an output from said processor. 

26. (Currently amended) The system according to cMm 25, wherein said open reading frame 
(ORF) comprises a portion of said DNA sequence between a start codon and a stop codon, 

wheiein said processor identifies a match of a pattern from said pattern database in said 
amino acid translation by using a predetermined pattern matching algorithm, 

wherein each pattern is assigned a weight depending upon a relevance of said pattern in 
determining whether said ORF comprises at least part of a putative gene, and 

wherein said ORF is reported as at least part of a putative gene when either a 
predetermined number of pattern matches is identified in said amino acid translation, or a sum of 
weights corresponding to all patterns with matches in said amino acid translation exceeds a 
predetermined threshold. 

27. (Previously presented) The system according to claim 1, wherein said processor accesses 
said pattern database to retrieve said patterns from said pattern database. 

28. (Previously presented) The system according to claim 1, wherein said processor is 
electrically coupled to said input device and said pattern database. 
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29. (Currently amended) A system for identifying genes, comprising: 
an input device which inputs a genomic DNA sequence; and 
a processor which: 

accesses a pattern database comprising a plurality of patterns of zimino acids; 
translates an open reading frame (ORF) of said DNA sequence into an £imino acid 

translation; 

locates in said amino acid translation occurrences of said patterns from said 
pattern database; and 

determines whether said open reading frame includes at least part of a putative 
gene based on a number of said patterns of amino acids located in said amino acid translation of 
said ORF. and/or weighted values associated with said occurrences of said patterns of amino 
acids located in said amino acid translation of said ORF. 



30. (Currently amended) A system for identifying genes, comprising: 
an input device which inputs a query genomic DNA sequence; 
a processor which: 

accesses a pattern database comprising a plurality of patterns of amino acids; 
translates an open reading frame (ORF) of said DNA sequence into an amino acid 

translation; and 

locates in said amino acid translation occurrences of said patterns from said 
pattern database; 

determines whether said open reading frame includes at least part of a putative 
gene based on a number of said patterns of amino acids located in said amino acid translation of 
said ORF, and/or weighted values associated with said occurrences of said patterns of amino 
acids located in said amino acid translation of said ORF; and 

a display device for displaying an output of said processor, said output including an 
occurrence of said patterns in said amino acid translation, 

wherein said patterns comprises patterns derived using a Teiresias algorithm. 
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wherein said open reading frame (ORF) comprises a portion of said DNA sequence 
between a start codon and a stop codon, 

wherein said processor identifies a match of a pattern from said pattern database in said 
amino acid translation by using a predetermined pattern matching algorithm, 

wherein each pattern is assigned a weight depending upon a relevEince of said pattern in 
determining whether said ORF comprises at least part of a putative gene, and 

wherein said ORF is reported as at least part of a putative gene when either a 
predetermined number of pattern matches is identified in said amino acid translation, or a sum of 
weights corresponding to all patterns with matches in said amino acid translation exceeds a 
predetermined threshold. 



